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Abstract 

We consider the so-called measure once finite quantum automata 
model introduced by Moore and Crutchfield in 2000. We show that 
given a language recognized by such a device and a linear context- 
free language, it is recursively decidable whether or not they have a 
nonempty intersection. This extends a result of Blondel et al. which 
can be interpreted as solving the problem with the free monoid in place 
of the family of linear context-free languages. 



1 Introduction 



Quantum finite automata or simply quantum automata were introduced at 
the beginning of the previous decade in [10J as a new model of language 
recognizer (cf. [3]). Numerous publications have ever since compared their 
decision properties to those of the older model of probabilistic finite au- 
tomata. Some undecidable problems for probabilistic finite automata turn 

*The third author acknowlegdes the support of fundings "AST 2009" of the University 
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out to be decidable for quantum finite automata. The result in [5] which 
triggered our investigation can be viewed as asserting that the intersection 
emptiness problem of a language recognized by a finite quantum automaton 
with the free monoid is recursively decidable. The present result concerns the 
same problem where instead of the free monoid, more generally a language 
belonging to some classical families of languages such as the context-free 
languages and the bounded semilinear languages is considered. 

An ingredient of the proof in [5] consists of expressing the emptiness 
problem in the first order theory of the reals and then to apply Tarski- 
Seidenberg quantifier elimination. This is possible because an algebraic 
subset, i.e., a closed subset in the Zariski topology A C ]R n , is naturally 
associated to this intersection and even more miraculously because this sub- 
set can be effectively computed (cf. also [6]). 

Here we show that the (actually semi-)algebraicity of A still holds when 
considering not only the free monoid but more generally arbitrary context- 
free languages and bounded semilinear languages. Unfortunately, its effec- 
tive construction is only guaranteed under stricter conditions such as the 
fact that the language is context-free and linear or is bounded semilinear. 
In particular, in the case of context-free languages, we are not able to settle 
the nonlinear case yet. 

We now give a more formal presentation of our work. The free monoid 
generated by the finite alphabet X is denoted by X*. The elements of X* are 
words. We consider all finite dimensional vector spaces as provided with the 
Euclidian norm || • ||. A quantum automaton is a quadruple Q = (s,<p,P,X) 
where s G M. n is a row- vector of unit norm, P is a projection of M. n , <p is 
a representation of the free monoid X* into the group of orthogonal n x n- 
matrices in M nxn and the threshold A has value in M. We recall that a real 
matrix M is orthogonal if its inverse equals its transpose: M _1 = M T . 
We denote by O n the group of n x n-orthogonal matrices. We are mainly 
interested in effective properties which require the quantum automaton to 
be effectively given. We say that the quantum automaton is rational if all 
the coefficients of the components of the automaton are rational numbers, 
i.e., if maps X* into Q nxn and A £ Q. This hypothesis is not a restriction 
since all we use for the proofs is the fact that the arithmetic operations 
and the comparison are effective in the field of rational numbers. This is 
the "measure once" model introduced by Moore and Crutchfield in 2000 
[TO] . For a real threshold A, the languages recognized by Q with strict and 
nonstrict threshold A are 

|Q > | = { w g S* | \\sip(w)P\\ > A}, \Q>\ = {w e X* | \\sv(w)P\\ > A} 



2 



Blondel et al. in [5] proved that the emptiness problem of | Q> | is decidable 
and the emptiness problem of \ Q>\ is undecidable. It is worth to remark that 
these results are in contrast with the corresponding ones for probabilistic 
finite automata. Indeed for this class of automata the above mentioned 
problems are both undecidable (see [12], Thm 6.17). It is also proven in 
[8] that the problems remain both undecidable for the "measure many" 
model of quantum automata, a model not computationally equivalent to 
the "measure once", introduced by Kondacs and Watrous in [9]. 

The result of decidability proved in [5j for quantum automata can be 
interpreted as saying that the emptiness problem of the intersection of a 
language accepted by a quantum automaton and the specific language E* 
is decidable. In other word, it falls into the category of issues asking for 
the decision status of the intersection of two languages. It is known that 
such a problem is already undecidable at a very low level of the complexity 
hierarchy of recursive languages, namely for linear context-free languages to 
which Post Correspondence Problem can be easily reduced. 

A few words on the technique used in the above paper. Observe that, 
with the natural meaning of the notation \Q<\, the emptiness problem for 
languages | Q> | is equivalent to the inclusion 

£*C|Q<| (1) 

Since the function M — > \\sMP\\ is continuous, it is sufficient to prove 
that for all matrices M in the topological closure of y?(X*) the condition 
||sMP|| < A holds. The nonemptiness is clearly semidecidable. In order to 
prove that the emptiness is semidecidable the authors resort to two ingre- 
dients. They observe that the topological closure of the monoid of matrices 
<£)(£*) is algebraic, i.e., when considering the n x n-entries of a matrix M 
in the topological closure of </?(£*) as as many unknowns in the field of re- 
als, they are precisely the zeros of a polynomial in R[xi i i, . . . , x n ,n]- This 
allows them to express the property (pQ) in first-order logic of the field of re- 
als. The second ingredient consists of applying Tarski-Seidenberg quantifier 
elimination and Hilbert basis results, which yields decidability. 

We generalize the problem by considering families of languages C instead 
of the fixed language X*. The question we tackle is thus the following 

(L,Q) INTERSECTION 

Input: a language L in a family of languages C and a finite quantum au- 
tomaton Q. 

Question: does L n \ Q>\ =0 hold? 
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Our main result shows that whenever C is the family of linear context-free 
languages or is the family of bounded semilinear languages, and whenever 
the automaton is rational, the problem is decidable. It can be achieved, not 
only because the orthogonal matrices associated with L are semialgebraic 
(a more general property than algebraic, which is defined by more general 
first-order formulas), but also because these formulas can be computed "in 
the limit". 

We can prove the semialgebraicity of more general families of languages: 
arbitrary subsemigroups which is a trivial case and context-free languages 
which is less immediate. Finally we show that our main results are not trivial 
since we can exhibit an example of a language which is the complement of 
a context-free language and whose set of matrices is not semialgebraic. 

Some of the results of this paper will be presented at DLT 2013 [4]. 

2 Preliminaries 

A quantum automaton Q is a quadruple (s, (f, P, A) where, as mentioned 
in the Introduction, s £ W 1 is a vector of unit norm, P is a projection 
of W 1 , ip is a representation of the free monoid S* into the group O n of 
orthogonal n x n-matrices in M nxn . The behaviour of Q heavily depends on 
the topological properties of the semigroup of matrices </?(£*). This is why, 
before returning to quantum automata, we first focus our attention on these 
matrices for their own sake. 

2.1 Topology 

The following result is needed in the proof of the main theorem. Though 
valid under weaker conditions, it will be considered in the particular case of 
orthogonal matrices. Given a subset E of a finite dimensional vector space, 
we denote by Cl(E) the topological closure for the topology induced by 
the Euclidian norm. Given a /c-tuple of matrices (Mi, . . . , M*.), denote by 
/ the /c-ary product /(Mi, . . . , M&) = Mi ■ ■ ■ M& and extend the notation 
to subsets p of A;-tuples of matrices by posing f(p) = {/(Mi, . . . ,Mp.) \ 
(Mi, . . . , Mfc) G p}. The following result will be applied in several instances 
of this paper. It says that because we are dealing with compact subsets, the 
two operators of matrix multiplication and the topological closure commute. 
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Theorem 1 Let C be a compact subset of matrices and let p C C k be a k-ary 
relation. Then we have 

Cl(f(p)) = f(Cl(p)) 

Proof. Since the function / is continuous, the inverse image of C/(/( / o)) is 
closed, i.,e., Cl(p) C /- 1 (C/(/(p))) holds which yields f(Cl(p)) C Cl{f(p)). 
Now we prove the opposite inclusion. Consider an element A G Cl(f(p)). It 
is the limit of a sequence M\^ n ■ ■ ■ M^^ n where (Mi jn , • • • , Mfc >n ) G p for n > 0. 
Because C is a compact set, there exists a subsequence (Mi jni , • • • , Mfc in J G 
p, i.e., an infinite sequence of strictly increasing indices Hi which converges to 
a limit point (A\, • • • , A^) G Cl(p). By continuity we have f{A\, • • • , = 
y4 which shows that CI (/(/>)) C f(Cl(p)). □ 

Consequently, if /j is a binary relation which is a direct product p±xp2, we 
have Cl(p\p2) = f(Cl(pi x ^2))- It is an elementary result of topology that 
Cl(pixp 2 ) = C/(pi)xCZ(/9 2 ) holds. Because of Cl(pip2) = f(Cl(pi xp 2 )) = 
f(Cl( Pl ) x Cl{p 2 )) = Cl{ Pl ) Cl{p 2 ) we have 

Corollary 1 The topological closure of the product of two sets of matrices 
included in a compact subspace is equal to the product of the topological 
closures of the two sets. 

2.2 Algebraic and semialgebraic sets 

Let us give first the definition of algebraic set over the field of real numbers 

(cf. mm)- 

Definition 1 A subset A C M n is algebraic (over the field of real numbers), 
if it satisfies one of the following equivalent conditions: 

(i) A is the zero set of a polynomial p G . . . ,x n ], i.e., 

veA p{v) = 0. (2) 

(ii) A is the zero set of an arbitrary set of polynomials V with coefficients 
in R[xj, . . . ,x n ], i.e., for every vector v G M. n , 

veA V peV : p{v) = 0. (3) 
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The equivalence of the two statements is a consequence of Hilbert finite 
basis Theorem. Indeed, the theorem claims that given a family V there 
exists a finite subfamily p\ , . . . , p r generating the same ideal which implies 
in particular that for all p £ V there exist q\ , . . . , q r with 

P = QiPi H h QrPr 

Then pj(v) = for j = 1, . . . , r implies p(v) = 0. Now this finite set of 
equations can be reduced to the single equation 

n 

I>(*) 2 = 
i=i 

As a trivial example, a singleton {v} is algebraic since it is the unique 
solution of the equation 

n 

Y,{xi - Vif = 

i=i 

where Vi, with 1 < i < n, is the i-th component of the vector v. 

It is routine to check that the family of algebraic sets is closed under 
finite unions and intersections. However, it is not closed under complement 
and projection. The following more general class of subsets enjoys extra 
closure properties and is therefore more robust. The equivalence of the two 
definitions below is guaranteed by Tarski-Seidenberg quantifier elimination 
result. 

Definition 2 A subset A C W l is semialgebraic (over the field of real num- 
bers) if it satisfies one of the two equivalent conditions 

(i) A is the set of vectors satisfying a finite Boolean combination of predicates 
of the form p(x±, . . . , x n ) > where p G M.[x\, . . . , x n ]. 

(ii) A is first-order definable in the theory of the structure whose domain 
are the reals and whose predicates are of the form p(x±, . . . ,x n ) > and 
p(x±, . . . , x n ) = with p 6 . . . , x n ]. 

We now specify these definitions to square matrices. 

Definition 3 A set A C R nxn of matrices is algebraic, resp. semialge- 
braic, if considered as a set of vectors of dimension n 2 , it is algebraic, resp. 
semialgebraic. 
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We now combine the notions of zero sets and of topology. In the following 
two results we rephrase Theorem 3.1 of [5] by emphasizing the main features 
that serve our purpose (see also [5j [H]). Given a subset E of a group, we 
denote by (E) and by E* the subgroup and the submonoid it generates, 
respectively. 

Theorem 2 Let S C M nxn be a set of orthogonal matrices and let E be 
any subset of S satisfying (S) = (E). Then we have Cl(S*) = Cl((E)). In 
particular Cl(S*) is a group. 

Proof. It is known that every compact subsemigroup of a compact group is 
a subgroup G. Now S* C (E) implies G = Cl(S*) C Cl((E}) and S C G 
implies Cl((E}) C G and thus CZ(S*) = Cl((E)). □ 

The main consequence of the next theorem is that the topological closure 
of a monoid of orthogonal matrices is algebraic 

Theorem 3 Let E be a set of orthogonal matrices. Then Cl((E)) is a 
subgroup of orthogonal matrices and it is the zero set of all polynomials 
p[x\i, . . . ,x n ,n] satisfying the conditions 

p[I) = and p(eX) = p(X) for all e G E 

Furthermore, if the matrices in E have rational coefficients, the above con- 
dition may be restricted to polynomials with coefficients in Q. 

Proof. It is clear that Cl((E)) is a subgroup of orthogonal matrices, say 
G. By |1H Thm 5, p. 133] this group is the zero set of all polynomials 
p[xi t i, . . . ,x n ,n] satisfying the conditions (where / denotes the identity ma- 
trix) 

p(J) = and p(gX) = p(X) for all g G G (4) 

Let us verify that we may assume the above condition is satisfied by all 
e £ E. First, if it is the case, it is satisfied for all elements of the group (E). 
Now observe that condition @ defines a linear constraint on the coefficients 
of the polynomial. Indeed if V G M. d is the vector of coefficients of the 
polynomial p and if X is viewed as a set of n 2 variables, identifying the 
coefficients of the monomials of the two hand-sides of (j4|) yields a system of 
linear equations 

MV = V 
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where the matrix M depends on g only, say M = M g . Let 
lim gi = g and M gi V = V for alH > 

i— >oo 

Then by continuity we have M g V = V. 

The last assertion concerning the case where the coefficients are rational 
can be found in Th. 3.1]. □ 

Combining the previous two theorems, we get the general result 

Corollary 2 Let L C £*. Then Cl(ip(L)*) is algebraic. 

2.3 Effectiveness issues 

We now return to the (L, Q) INTERSECTION problem as defined in the 
Introduction. We want to prove the implication 

\/X : X £ <p{L) => \\sXP\\ < A 

We observed that due to the fact that the function X — > ||sXP|| is contin- 
uous the implication is equivalent to the implication 

\/X : X £ Cl((p(L)) => \ \sXP\\ < A 

It just happens that under certain hypotheses, Cl(ip(L)) is semialgebraic, 
i.e., it is defined by a first-order formula which turns the above statement into 
a first order formula. In the simplest examples, the closure is defined by an 
infinite conjunction of equations which by Hilbert finite basis result reduces 
to a unique equation. Thus Theorem [3] guarantees the existence of the 
formula but does not give an upper bound on the finite number of equations 
which must be tested. Therefore the following definition is instrumental for 
the rest of the paper. It conveys the idea that given a subset A of matrices 
there exists a sequence of formulas defining a non-increasing sequence of 
matrices which eventually coincide with A. Each formula of the sequence 
can thus be considered as an approximation of the ultimate formula. 

Definition 4 A subset A of matrices is effectively eventually definable if 
there exists a constructible sequence of first- order formulas satisfying the 
conditions 

1 ) for all i > <fo+i => (pi 

2) for alii > A \= & 

3) there exists n>0B\=<f> n ^BQA 
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The following is a first application of the notion and illustrates the discussion 
before the definition. 

Proposition 1 Let Q be a rational quantum automaton. Let L C S* be 
such that the set C/(y?(L)) is effectively eventually definable. It is recursively 
decidable whether or not Ln \ Q>\ = holds. 

Proof. Equivalently we prove the inclusion L C \Q<\. In order to prove 
that the inclusion is effective, we proceed as in [5]. We run in parallel two 
semialgorithms. The first one verifies the noninclusion by enumerating the 
words w G L and testing if > A holds. The second semialgo- 

rithm considers a sequence of formulas (fii(X), i = 0, . . . , which effectively 
eventually defines Cl{ip{L)) and verifies whether the sentence 

= VX : <f>i(X) sXP < A 

holds which can be achieved by Tarski Seidenberg elimination result. If the 
inclusion L C \Q<\ holds then the first semialgorithm cannot answer "yes" 
and the second semialgorithm will eventually answer "yes" . If the inclusion 
does not hold then the second semialgorithm cannot answer "yes" for any 
since the second condition of the Definition [4] implies X G CZ(c/?(L)) =^ 4>i(X) 
and thus 

VX : X G C%(L)) \ \sXP\\ < A 
a contradiction. □ 

We state a sufficient condition for a subset of matrices to be effectively 
eventually definable. 

Let S C W ixn be a set of orthogonal matrices and let E be any subset 
satisfying (S) = (E). 

Proposition 2 Let L C S* and let E C Q nxra be a finite subset of or- 
thogonal matrices satisfying (ip(L)) = (E). Then Cl(ip(L)*) is effectively 
eventually definable. 

Proof. Indeed, set A = Cl(<p(L)*) = Cl((E)) where the last equality is 
guaranteed by Theorem [2 Then A is the zero set of all polynomials p(X) 
where p satisfies the condition 

p(I) = and p(gX) = p(X) for all g G A 

Since it clearly suffices to verify the invariance of p under the action of the 
finite set of generators, we proceed as follows. We enumerate all polynomials 
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p G Q[xi,i, • • • , x n>n ] say Po,pi, • • •• For each such polynomial p the invariance 
relative to the action of each generator can be tested. Thus the formula 

i 

tj>i(X)= f\Pj(X) = 

3=1 

effectively eventually defines A: the first two conditions can be readily ver- 
ified and the last one is a consequence of Hilbert finite basis theorem on 
ideals of polynomials. □ 



2.4 Closure properties 



In this paragraph we investigate some closure properties of the three differ- 
ent classes of matrices: algebraic, semialgebraic and effectively eventually 
definable, under the main usual operations as well as new operations. 

We define the sandwich operation denoted by o whose first operand is a 
set of pairs of matrices A C W ixn x M. nxn and the second operand a set of 
matrices B C IR nxn by setting 

A o B = {XYZ | (X, Z) G A and Y G B} 

The next operation will be used. Given a bijection 

I i,j G {1, . . . ,n}} ->■ | i,j G {l,...,n}} (5) 

and a matrix M G W nxn denote by vr(M) the matrix ir(M)ij = M^jy 
Extend this operation to subsets of matrices A. Write n(A) to denote the 
set of matrices 7r(M) for all M G A. 

The last operation is the sum of square matrices Mi, . . . , M& whose result 
is the square block matrix 



Mi 



M, 



/Mi 
M 2 







••■ \ 
... o 

M k j 



(6) 



These notations extend to subsets of matrices in the natural way. Here we 
assume that all k matrices have the same dimension n x n. Observe that if 
the matrices are orthogonal, so is their sum. Such matrices form a subgroup 
of orthogonal matrices of dimension kn x kn. 

Logic provides an elegant way to formulate properties in the present 
context. Some conventions are used throughout this work. E.g., we write 
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3 n X when we mean that X is a vector of n bound variables. Furthermore, a 
vector of n x n variables can be interpreted as an n x n matrix of variables. 
As a consequence of Tarski-Seidenberg result, consider two semialgebraic 
subsets of matrices, say A\ and A 2 , defined by two first-order formulas 
(fri(Xi) and (ft 2 (X 2 ) where X\ and X2 are two families of n 2 free variables 
viewed as two n x n matrices of variables. Then the product 

A X A 2 = {MiM 2 I Mi G Ai,M 2 G A 2 } 

is defined by the following formula where X is a family of n 2 free variables 
viewed as an n x n matrix 

3 nxn X 1 3 nxn X 2 : X = X X X 2 A M*i) A M^ 2 ) 

where X = X\X 2 is an abbreviation for the predicate defining X as the 
matrix product of X\ and X 2 . This proves that the product of two semi- 
algebraic sets of matrices is semialgebraic. Similarly we have the following 
closure properties whose verification is routine. 

Proposition 3 Let A\,A 2 C R nxn be two sets of matrices and let tt be a 
one-to-one mapping as in 

1) If Ai and A 2 are algebraic so are A\ U A 2 and ir(A\). 

2) If A\ and A 2 are semialgebraic, resp. effectively eventually definable, so 
are A\ U A 2 , A\A 2 and n(Ai). 

Proposition 4 Let Ai C R nxn x R nxn and A 2 C R nxn be semialgebraic, 
resp. effectively eventually definable. Then A± o A 2 is semialgebraic, resp. 
effectively eventually definable. 

Proposition 5 Let A be a semialgebraic, resp. effectively eventually defin- 
able, set of kn x kn matrices of the form The set 

{Xi ■ ■ ■ x k 1 Xi e • • • e x k g A} 

is semialgebraic, resp. effectively eventually definable. 

Proposition 6 If A\, . . . ,A k ^ M nxra are semialgebraic, resp. effectively 
eventually definable sets of matrices then so is the set A\ @ ■ ■ ■ A k . 
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3 Context-free languages 



For the sake of self-containment and in order to fix notation, we recall the 
basic properties and notions concerning the family of context-free languages 
which can be found in all introductory textbooks on theoretical computer 
science (see, for instance, [7]). 

A context-free grammar G is a quadruple (V, E, P, S) where S is the 
alphabet of terminal symbols, V is the set of nonterminal symbols, P is the 
set of rules, and S is the axiom of the grammar. A word over the alphabet 
S is called terminal. As usual, the nonterminal symbols are denoted by 
uppercase letters A, B, .... A typical rule of the grammar is written as 
A — > a. The derivation relation of G is denoted by =>. 

A grammar is linear if every right hand side a contains at most one 
occurrence of nonterminal symbols, i.e., if it belongs to X* U T,*VT,*. 

The idea of the following notation is to consider the set of all pairs of left 
and right contexts in the terminal alphabet of a self-embedding nonterminal 
symbol. In the next definition, the initial "C" is meant to suggest the term 
"context" as justified by the following. 

Definition 5 With each nonterminal symbol A G V associate its terminal 
contexts defined as 

C A = {(q,/3) G £* x £* : A^aA/3}. 

It is convenient to define the sandwich operation also for languages in the 
following way. With Ca as above and V an arbitrary language, we define 

Ca o L = {uwv | (u, v) G Ca and w G L'} 

As the proof of the main theorem proceeds by induction on the number of 
nonterminal symbols, we need to show how to recombine a grammar from 
simpler ones obtained by choosing an arbitrary non-axiom symbol as the 
new axiom and by canceling all the rules involving S. This is the reason for 
introducing the next notation 

Definition 6 Let G = (V,T,,P,S) be a context-free grammar. Set V = 
V\{S}. 

For every A G V , define the context-free grammar Ga = {V ,Y>,Pa, A) 
where the set Pa consists of all the rules B — > 7 of G of the form 

BeV, 7G(F / U£)* 

and denote by La the language of all terminal words generated by the gram- 
mar Ga- 
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The next definition introduces the language of terminal words obtained in 
a derivation where S occurs at the start only. 

Definition 7 Let L'(G) denote the set of all the words of E* which admit 
a derivation 

S => 71 => • • • =>• 7£ w (7) 
where, for every i = I, ...,£, ^ £ (V' U £)*. 

The language L'(G) can be easily expressed in terms of the languages 
L A for all A EV' . Indeed, consider the set of all rules of the grammar G of 
the form 

/3e(y'u£)* (8) 

Factorize every such /3 as 

P = wiAiw 2 A 2 • • • wiA e w je+1 (9) 

where w\, . . . , wi + \ G X* and A\, A2, ■ ■ ■ An 6 1/'. The following is a standard 
exercise. 

Lemma 1 With the notation of (0|), the language L'{G) is the (finite) union 
of the languages 

wiL Ai w 2 La 2 ■ ■ ■ wnL At Wj t+1 
when (3 ranges over all rules 

Proposition 7 With the previous notation L is a finite union of languages 
of the form Cs o L" where 

L" = wiL Al w 2 L A2 ■ ■ ■ weL Ae w e+ i 

Proof. In order to prove the inclusion of the right- into left- hand side, it 
suffices to consider w = au/3, with u G L'{G) and (a, 0) G Cs- One has 
S =>u and S => aS(3 and thus S =5> otSf}^ aufi. 

Let us prove the opposite inclusion. A word w G L admits a derivation 
S w. If the symbol S does not occur in the derivation except at the start 
of the derivation, then w G L'(G). Otherwise factor this derivation into 
S =k> aSf3 w such that S does not occur in the second part of the derivation 
except in the sentential form aSfi. Reorder the derivation aSj3 =>• w into 
aS(3 jS6 =4> w so that 7,(5 G S*. This implies w = *yu5 for some word 
u G L'(G), completing the proof. □ 
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4 The main results 



Here we prove that the problem is decidable for two families of languages, 
namely the linear context-free languages and the linear bounded languages. 

4.1 The bounded semilinear languages 

We solve the easier case. We recall that a bounded semilinear language is a 
finite union of linear languages which are languages of the form 

L = {w^---w n k k | (ni,...,n fc ) G R} (10) 

for some fixed words 6 E* for i = l,...,k and R C N fc is a linear set, 
i.e., there exists vq, v\, . . . , v p G N fc such that 

R = {v + Xivi H + XpVp | Ai, . . . , X p e N} 

Proposition 8 7/L is bounded semilinear then its closure Cl(ip(L)) is semi- 
algebraic. Furthermore, if the quantum automaton Q is rational, the (L, Q) 
intersection is decidable. 

Proof. Because the semialgebraic sets are closed under finite union, it suffices 
to consider the case where the language is of the form (jlOp . For t = 0, . . . ,p 
set vf = (i>t,i, • • • ,ft,fe) and consider the orthogonal matrices 



fjt 



( ip{w x f^ \ 
ip(w 2 ) Vt > 2 

V <p(w k ) v ^ J 



Set G = {gi | i = 1, . . . ,p}. In virtue of Theorem [2] and Proposition [2] the 
set Cl(G*) is semialgebraic and it is effectively eventually definable if the 
coefficients of the quantum automata are rational. By Corollary [T] we have 

C% G*) = g Cl(G*) 

and by Proposition [3] this product is semialgebraic (resp. and effectively 
eventually definable if the coefficents of the quantum automaton are ratio- 
nal). By Proposition [5l C/(c^(L)) is semialgebraic (resp. and effectively 
eventually definable if the coefficients of the quantum automaton are ratio- 
nal). In the latter case the (L,Q) intersection is decidable by Proposition [1] 
which completes the proof. □ 
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4.2 The case of context-free languages 

Here we show that Cl((p(L)) is effectively eventually definable for languages 
generated by linear grammars and rational quantum automata. 

We adopt the notation from Section [3] for context-free grammars. We 
recall the following notion that will be used in the proof of the next result 
(see [13] )• A subset of a monoid M is regular if it is recognized by some 
finite M-automaton which differs from an ordinary finite nondeterministic 
automaton over the free monoid by the fact the transitions are labeled by 
elements in M. 

Proposition 9 If L is generated by a context-free grammar, then Cl(ip(L)) 
is semialgebraic. Furthermore, if the grammar is linear and if the quantum 
automaton is rational then Cl((p(L)) is effectively eventually definable and 
the (L, Q) intersection is decidable. 

Proof. With the notation of Section [3] the language L is a finite union of 
languages of the form Cs o L" with 

L" = w 1 L Ai w 2 La 2 ■ ■ ■ w e L Ae w i+1 (11) 

where, for every 1 < i < £ + 1, Wj £ S* and A4 G V . It suffices to show 
by induction on the number of nonterminal symbols that, with the previous 
notation, the subsets 

Cl(<f(C s oL")) (12) 

are semialgebraic in all cases and effectively eventually definable when the 
quantum automaton is rational and the grammar of the language is linear. 
As a preliminary remark let us show this property for Cl(<p(Cs))- Define 
tp T : £* -»• M. nxn as ip T (u) = (p(u) T and set 

M = {</?(«) © (p T (b) I (a, b) £ C s }. 

Observe that M is a monoid since if ip(a) © (p 1 \b) and ip(c) © (p J \d) are 
in M then we have 

<P T (b)<p T (d) = v{b) T v {d) T = (<p(dMb)) T = <p{db) T = <p T (db) 

which yields 

(99(a) © <p T (b))(ip(c) © v? T (d)) = ip(ac) © (p T (db). 

As a first consequence, by Corollary [21 Cl(M) is algebraic. Furthermore 
we can show that C/(M) is effectively eventually definable. Indeed M is 
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a regular submonoid of the group of orthogonal matrices O n © O n if the 
grammar is linear. Precisely, it is recognized by the finite 02 n -automaton 
whose states are the nonterminal symbols, the transitions are of the form 

A — — - - \ B where A — > aBb is a rule of the grammar and where the 
initial and final states coincide with S. Now, the subgroup generated by a 
regular subset of a monoid has an effective finite generating set [1] (see also 
|13j ) and thus by Proposition [2] Cl(M) is effectively eventually definable if 

¥>(£*) Q Q nxn . 

We now proceed with the proof by induction on the number of nonter- 
minal symbols. If the set of nonterminal symbols is reduced to S then L 
is reduced to C50 L'(G) and L'{G) is finite. We may further assume that 
there is a unique terminal rule S —> w. By Theorem [1] we have 

Cl((p(L)) = {X^{w)Y T \ X®Y® Mw)} G Cl(M © (p(w))} 

By Corollary Q] we have 

Cl(M © tp{w)) = CZ(M) © Cl(tp(w)) = Cl(M) © (p(w) 

which, by Proposition El is semialgebraic, resp. effectively eventually defin- 
able. In that latter case the (L, Q) intersection is decidable. 

Now assume V contains more than one nonterminal symbol. We first 
prove that for each nonterminal symbol A, Cl{ip{C s o L a)) is semialgebraic 
in the general case and effectively eventually definable when the grammar is 
linear and the quantum automaton is rational. By Theorem [T] and Corollary 
E Cl(v(C s o L")) is the subset 

Cl(ip{C s o L")) = {XZY T I XffiYffiZe Cl(M) C%(L")} 

with L" as in (jlip . i.e., 

{XZY T I X®YQZ £Cl(M)®Cl(ip(w 1 )i P (L Al )---ip(w e )ip(L Ae )ip(w i+1 ))} 

By Cororally [1] we have 

Cl((p(wi)(p(L Al ) ■ ■ ■ (p(wt)tp(LA t )tp(wt+i)) 
= tp(wi)Cl(cp(LA 1 )) • • • (p{we)Cl(ip(L Ae ))tp(w e+1 )) 

which shows, via Proposition [3] and by induction hypothesis that this sub- 
set is semialgebraic, resp. effectively, eventually definable. Then its direct 
sum with C/(M) is semialgebraic and effectively, eventually definable if the 
grammar is linear and the quantum automaton is rational. We conclude by 
applying Proposition [5l □ 



16 



5 Complement of context-free languages 



In this section we prove that there is a language L such that (i) the com- 
plement of L is context-free and (ii) Cl(ip(L)) is not semialgebraic. 

Given a binary representation of a real a = O.fei we define its 

approximation sequence (a[k])k>o as the sequence of its successive trunca- 
tions a[k] = O.bi . . . bk- 

Lemma 2 Let < a < 1 be an irrational. There exist infinitely many 
rationals £ such that 



a[l + 2i{n)} 



n 



< 



holds, where i(n) = [log 2 n\ . 

Proof. By the triangular inequality we have 



a[l + 2t{n)\ - 1 < \a[l + 2£(n)] - a\ + 



a — 



n 



By the definition of the approximation sequence we get 
\a[l + 2l(n)]-a\< ¥ ^ m = ±x 



1 1 
C - x — 



2 2Llog 2 nJ - 2 n 

Now by Hurwitz Theorem there exist infinitely many rationals ^ for which 

p 1 1 

a < — f= x — 9 

n ^5 n z 

We conclude by combining these last two inequalities. □ 



We now fix an irrational < a < 1. Consider the orthogonal matrix 



cos2-7ra sin27ra 
— sin 27TQ cos 2ira 



and the morphism (p a : b* — > O n from the free monoid generated by the 
letter b and the group O n defined by (f a (b) = M a . Furthermore set 



L(a) = \ b n | 3 q G N | a[l + 2€(n)] - - 



n 



< 



1 



(13) 



Lemma 3 IfO<a<lisan irrational, the topological closure Cl(f(L(a)) 
is not semialgebraic. 
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Proof. Since the element in position (1, 1) of the matrix (p a (b n ) is cos2irna 
and since the projection of a semialgebraic set is semialgebraic, it suffices to 
show that 

C/({cos2vrna | b n G L(a)}) 

is not semialgebraic. 

Observe that n ^ n' implies cos 2irna 7^ cos 2irn'a since a is irrational. 
In particular, the set {cos 2irna \ b n S L(a)} is infinite. 

Now we verify that 1 is the unique limit point. Indeed, by definition 
b n £ L{a) implies that for some integer q we have 



a[l + 2£(n)} - - 



1 

< — 



For such an integer q we have 



q 

a 

n 



<\a- a[l + 2i(n)} \ + a[l + 2£(n)] - - 



n 



113 1 

< ^ + ^T = ^X — 



2n 2 



ti- 



ll. 



Consequently, \na — q\ < | x — . Now we compute 



1 > cos27rna = cos27r(na 



cos 2ir\na — q\ > cos ■ 



3tt 



71 



which proves that the closure C/({cos27rna | b n G L(a)}) consists of a 
unique limit point and of infinitely many isolated points. This is not a 
semialgebraic set since the semialgebraic sets on the reals are finite unions 
of intervals. □ 



Now we state the main result of this section. 



Theorem 4 There is a language LCE* and a morphism (p : X* — > Q nxn , 
assigning an orthogonal matrix to every word o/S* ; such that (i) L is the 
complement of a context-free language (ii) the topological closure Cl(cp(L)) 
is not semialgebraic. 

Proof. Consider a one tape Turing machine implementing the following 
procedure for recognizing the language L(a) defined in (fT3j) : 

Input b n 

A <— a[l + 2£(n)} 
F <- 

for q = 1 to n, if 1^4 — ^1 < A then F «- 1 
if F = 1 then write ab n , 
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position the head on the rightmost occurrence of b, 

change to a new state q, move the reading head to the leftmost cell while 
staying in state q 

stop when reaching the occurrence a. 

We know that the computation histories of a Turing machine, i.e., the 
set of sequences of configurations properly separated by a new symbol is, 
as a language, the intersection of two linear context-free languages (see, for 
instance, [7J, Lemma 8.6). Let Hist(b n ) be the history associated to the 
input b n . Let T be the disjoint union of the symbols comprising the input 
and tape alphabets along with the states including the special state q. With 
a = arctan § we get the orthogonal matrix 



M„ = ( 5 4 § 



3 4 
5 

4 3 

5 5 



Define the morphism ip : T* — > O2 by 

V (c)-=l 1 ifcGr \^ 
nC> • \ M a if c = q 

By applying the result mentioned above to the Turing machine implementing 
the procedure for recognizing the language L(a), we have that there exist 
two linear context-free languages L\ and L2 such that 



Hist(L a ) = Li n L 2 = (Li U L 

Since L\ and L2 are linear and deterministic context-free, L\ U L| is (not 
necessarily deterministic) context-free and thus Hist(L a ) is the complement 
of a context-free language. But then 

<p(Hist(L a )) = {Ml I b n G L a } 

We conclude by applying Lemma [3l □ 



References 

[1] A. V. Anisimov, and F. D. Seifert. Zur algebraischen Charakteristik der 
durch Kontext-freie Sprachen deBnierten Gruppen. Elektron. Inform. Ve- 
rarb. u. Kybern. 11, 695-702, 1975. 

[2] S. Basu, R. Pollack, and M. -F. Roy. Algorithms in Real Algebraic Ge- 
ometry. Springer, Berlin, 2003. 



19 



[3] A. Bertoni, C. Mereghetti, and B. Palano. Quantum Computing: 1- 
Way Quantum Automata, in Proceedings of Developments in Language 
Theory, DLT 2003, Lecture Notes in Computer Science, Vol. 6224, pp. 
1-20, Springer, Berlin, 2003. 

[4] A. Bertoni, C. Choffrut, and F. D'Alessandro. Quantum finite automata 
and linear context-free languages, DLT 2013, 17th International Con- 
ference on Developments in Language Theory, 18th-21st of June 2013, 
Paris-Est, France. 

[5] V. D. Blondel, E. Jeandel, P. Koiran, and N. Portier. Decidable and 
Undecidable Problems about Quantum Automata. SIAM J. Comput. 
34, 1464-1473, 2005. 

[6] H. Derksen, E. Jeandel, and P. Koiran. Quantum automata and algebraic 
groups. J. Symb. Comput. 39, 357-371, 2005. 

[7] J. Hopcroft and J. D. Ullman. Introduction to Automata Theory, Lan- 
guages and Computation. Addison- Wesley, 1979. 

[8] E. Jeandel. Indecidabilite sur les automates quantiques. Master's thesis. 
ENS Lyon, 2002. 

[9] A. Kondacs, J. Watrous. On the power of quantum finite state automata. 
Proceedings of the 38th Annual Symposium on Foundations of Computer 
Science, pp. 66-75, 1997. 

[10] C. Moore, and J. Crutchfield. Quantum automata and quantum gram- 
mars. Theoret. Comput. Sci. 237, 275-306, 2000. 

[11] A. Onishchik and E. Vinberg. Lie Groups and Algebraic Groups. 
Springer, Berlin, 1990. 

[12] A. Paz. Introduction to Probabilistic Automata. Academic Press, New 
York, 1971. 

[13] J. Sakarovitch. Elements of Automata Theory. Cambridge University 
Press, Cambridge, 2009. 



20 



